NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SamplingDesign: RNA design via continuous optimization with coupled variables and Monte-Carlo sampling

https://doi.org/10.1038/s41467-025-67901-3

Tang, Wei Yu; Dai, Ning; Zhou, Tianshuo; Mathews, David H; Huang, Liang (February 2026, Nature Communications)

RNA design aims to find a sequence that can fold into a target secondary structure. It can create artificial RNA molecules for specific functions, with wide applications in medicine. It is computationally challenging due to two levels of combinatorial explosion: the exponentially large design space and the exponentially many competing structures per design. Popular methods such as local search cannot keep up with these combinatorial explosions. We instead employ two techniques from machine learning, continuous optimization and Monte-Carlo sampling. We start from a distribution over all valid sequences, and use gradient descent to improve the expectation of an arbitrary objective function. We define novel coupled-variable distributions to model the correlation between nucleotides. We then use sampling to approximate the objective, estimate the gradient, and select the final candidate. Our work consistently outperforms state-of-the-art methods in key metrics including Boltzmann probability and ensemble defect, especially on long and hard-to-design structures.
more » « less
Full Text Available
Emerging Higher-Carbon Nitrogenous Disinfection Byproducts: A Brief Review of Structures, Occurrence, and Research Needs

https://doi.org/10.1016/j.coesh.2025.100690

Mohamadi, Siavash; Werner, Christian A; Dai, Ning (November 2025, Current Opinion in Environmental Science & Health)

Full Text Available
Synergistically combining peracetic acid and reduced graphene oxide membranes to degrade trace organic contaminants

https://doi.org/10.1016/j.cej.2025.164302

Deng, Erda; Kralles, Zachary T; Mohamadi, Siavash; Das, Sagnik; Dias, Ruveen; Dai, Ning; Lin, Haiqing (August 2025, Chemical Engineering Journal)

Full Text Available
EnsembleDesign: messenger RNA design minimizing ensemble free energy via probabilistic lattice parsing

https://doi.org/10.1093/bioinformatics/btaf245

Dai, Ning; Zhou, Tianshuo; Tang, Wei Yu; Mathews, David H; Huang, Liang (July 2025, Bioinformatics)

Abstract MotivationThe task of designing optimized messenger RNA (mRNA) sequences has received much attention in recent years, thanks to breakthroughs in mRNA vaccines during the COVID-19 pandemic. Because most previous work aimed to minimize the minimum free energy (MFE) of the mRNA in order to improve stability and protein expression, which only considers one particular structure per mRNA sequence, millions of alternative conformations in equilibrium are neglected. More importantly, we prefer an mRNA to populate multiple stable structures and be flexible among them during translation when the ribosome unwinds it. ResultsTherefore, we consider a new objective to minimize the ensemble free energy of an mRNA, which includes all possible structures in its Boltzmann ensemble. However, this new problem is much harder to solve than the original MFE optimization. To address the increased complexity of this problem, we introduce EnsembleDesign, a novel algorithm that employs continuous relaxation to optimize the expected ensemble free energy over a distribution of candidate sequences. EnsembleDesign extends both the lattice representation of the design space and the dynamic programming algorithm from LinearDesign to their probabilistic counterparts. Our algorithm consistently outperforms LinearDesign in terms of ensemble free energy, especially on long sequences. Interestingly, as byproducts, our designs also enjoy lower average unpaired probabilities (which correlates with degradation) and flatter Boltzmann ensembles (more flexibility between conformations). Availability and implementationOur code is available on: https://github.com/LinearFold/EnsembleDesign.
more » « less
Full Text Available
Halogenation of Anilines: Formation of Haloacetonitriles and Large-Molecule Disinfection Byproducts

https://doi.org/10.1021/acs.est.4c05434

Kralles, Zachary T; Deherikar, Prashant K; Werner, Christian A; Hu, Ximin; Kolodziej, Edward P; Dai, Ning (October 2024, Environmental Science & Technology)

Full Text Available
LinearAlifold: Linear-time consensus structure prediction for RNA alignments

https://doi.org/10.1016/j.jmb.2024.168694

Malik, Apoorv; Zhang, Liang; Gautam, Milan; Dai, Ning; Li, Sizhen; Zhang, He; Mathews, David H; Huang, Liang (July 2024, Journal of Molecular Biology)

Full Text Available
In situ oxidation of reduced graphene oxide membranes by peracetic acid for dye desalination

https://doi.org/10.1016/j.memsci.2024.122991

Deng, Erda; Chen, Kai; Quigley, Aubrey E; Yuan, Mengqi; Zhu, Lingxiang; Kralles, Zachary T; Freeman, Benny D; Dai, Ning; Lin, Haiqing (July 2024, Journal of Membrane Science)

Full Text Available
Adsorption behavior of long-chain perfluoroalkyl substances on hydrophobic surface: A combined molecular characterization and simulation study

https://doi.org/10.1016/j.watres.2023.120074

Mohona, Tashfia M.; Ye, Zhijiang; Dai, Ning; Nalam, Prathima C. (July 2023, Water Research)

Full Text Available
LinearCoFold and LinearCoPartition: linear-time algorithms for secondary structure prediction of interacting RNA molecules

https://doi.org/10.1093/nar/gkad664

Zhang, He; Li, Sizhen; Dai, Ning; Zhang, Liang; Mathews, David H; Huang, Liang (August 2023, Nucleic Acids Research)

Abstract Many RNAs function through RNA–RNA interactions. Fast and reliable RNA structure prediction with consideration of RNA–RNA interaction is useful, however, existing tools are either too simplistic or too slow. To address this issue, we present LinearCoFold, which approximates the complete minimum free energy structure of two strands in linear time, and LinearCoPartition, which approximates the cofolding partition function and base pairing probabilities in linear time. LinearCoFold and LinearCoPartition are orders of magnitude faster than RNAcofold. For example, on a sequence pair with combined length of 26,190 nt, LinearCoFold is 86.8× faster than RNAcofold MFE mode, and LinearCoPartition is 642.3× faster than RNAcofold partition function mode. Surprisingly, LinearCoFold and LinearCoPartition’s predictions have higher PPV and sensitivity of intermolecular base pairs. Furthermore, we apply LinearCoFold to predict the RNA–RNA interaction between SARS-CoV-2 genomic RNA (gRNA) and human U4 small nuclear RNA (snRNA), which has been experimentally studied, and observe that LinearCoFold’s prediction correlates better with the wet lab results than RNAcofold’s.
more » « less
Full Text Available
Overlooked Contribution of the Indole Moiety to the Formation of Haloacetonitrile Disinfection Byproducts

https://doi.org/10.1021/acs.est.3c01080

Kralles, Zachary T.; Werner, Christian A.; Dai, Ning (May 2023, Environmental Science & Technology)

Full Text Available

« Prev Next »

Search for: All records